Disentangling the Structure of Tables in Scientific Literature

نویسندگان

  • Nikola Milosevic
  • Cassie Gregson
  • Robert Hernandez
  • Goran Nenadic
چکیده

Within the scientific literature, tables are commonly used to present factual and statistical information in a compact way, which is easy to digest by readers. The ability to "understand" the structure of tables is key for information extraction in many domains. However, the complexity and variety of presentation layouts and value formats makes it difficult to automatically extract roles and relationships of table cells. In this paper, we present a model that structures tables in a machine readable way and a methodology to automatically disentangle and transform tables into the modelled data structure. The method was tested in the domain of clinical trials: it achieved an F-score of 94.26% for cell function identification and 94.84% for identification of inter-cell relationships.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Role of Scientific Optimism in the Teachers’ Empowerment

Objectives: The optimism, happiness, creativity, emotional intelligence, self- awareness and, hope are the concepts that the positivist psychologists have addressed in recent decays. In this approach, the positive aspects come up strongly and more significant in different aspects of human life, with the scientific views and method. This study investigates the role of scientific optimism in...

متن کامل

Plain Answers to Several Questions about Association/Independence Structure in Complete/Incomplete Contingency Tables

In this paper, we develop some results based on Relational model (Klimova, et al. 2012) which permits a decomposition of logarithm of expected cell frequencies under a log-linear type model. These results imply plain answers to several questions in the context of analyzing of contingency tables. Moreover, determination of design matrix and hypothesis-induced matrix of the model will be discusse...

متن کامل

An Investigation into the Structure of Research Articles and Writing Guidelines in the Iranian Knowledge and Information Science Journals

Background and Aim: The purpose of this research is investigating the structure of research articles in the Iranian knowledge and information science journals (peer reviewed journals). In the next step, the writing guidelines in the scientific journals websites that designed to introduce desired structure of a scientific paper are studied. Methods: The research was survey with analytical approa...

متن کامل

Partial Association Components in Multi-way Contingency Tables and Their Statistiical Analysis

In analyses of contingency tables made up of categorical variables, the study of relationship between the variables is usually the major objective. So far, many association measures and association models have been used to measure  the association structure present in the table. Although the association measures merely determine the degree of strength of association between the study varia...

متن کامل

Sustainable Supply Chain Network Design: A Review on Quantitative Models Using Content Analysis

The purpose of this paper is to develop a systematic literature review on the subject of sustainable supply chain network design during 1990-2016, through a review of 261 papers. In this study, qualitative technique for conducting a systematic literature review was used. To systematize and make the literature review more accurate, content analysis method was used that include data collect...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016